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We discuss selected topics in the forthcoming MSTW 2008 determination of parton 
distributions by global analysis. The tolerance parameter controlling the uncertainties 
on the parton distributions is now determined by a new dynamic procedure for each 
eigenvector of the covariance matrix. New data sets fitted include Tevatron Run II 
data on inclusive jet production, the lepton charge asymmetry from W decays and the 
Z rapidity distribution. Predictions are given for the total W and Z cross sections at 
the Tevatron and LHC. 

This talk [I] covers recent developments in the imminent MSTW 2008 global analysis, 
which is a major update to the currently available MRST 2001 LO g], MRST 2004 NLO [3] 
and MRST 2006 NNLO g] parton distribution functions (PDFs). 

Much effort has been devoted in recent years towards providing PDFs with uncertainties 
obtained by propagating the experimental errors on the data sets included in the global 
fit. The most widely used procedure is the Hessian method pioneered by the CTEQ group. 
The Hessian matrix H is the matrix of second derivatives of the Xgio^ai with respect to 
the PDF parameters. The covariance matrix C = is diagonalised to give a set of 

eigenvalues Aj. and orthonormal eigenvectors vn^. The CTEQ and MRST fitting groups 
have produced eigenvector PDF sets with parameters {at} shifted from the values at 
the global minimum: ai{S^) = a° ± t^/XkVik, with t adjusted to give the desired tolerance 
T = (AXgiobai)^^^- Then PDF users can calculate uncertainties on a quantity F{{ai}) with 

(AF)'^ ~ (1/4) J2k [-^(^k) ^ ^("^aT)]^' "-"^ using a slightly more complicated formula to give 
asymmetric uncertainties. 

Ideally, with the standard parameter-fitting criterion, we would expect the errors to be 
given by the choice of tolerance T = 1 for the 68% (1-cr) confidence level (C.L.) limit, 
or T = -\/2.71 for the 90% C.L. limit. This is appropriate if fitting consistent data sets 
with ideal Gaussian errors to a well-defined theory. However, in practice, there arc minor 
inconsistencies between the independent fitted data sets, and unknown experimental and 
theoretical uncertainties, so this criterion is not appropriate for global PDF analyses. In- 
stead, the much weaker hypothesis-testing criterion has been used by the CTEQ and MRST 
groups, such that the eigenvector PDF sets are treated as alternative hypotheses. The tol- 
erance is then determined from the condition that each data set should be described within 
its 90% C.L. limit, giving approximate values of T = 10 (CTEQ) or T = ^50 (MRST), 
instead of the canonical parameter-fitting value of T = \/2.71 for the 90% C.L. limit. 

The choice of tolerance has been studied more quantitatively in the new MSTW 2008 
analysis. The 90% C.L. region for each data set n (with Nn data points) is defined as 
Xn < (x^ o/?5o) ^90, where Xno value of the goodness-of-fit measure for data set n 

at the global minimum, ^gg is the 90th percentile of the ^^-distribution with Nn degrees of 
freedom, and ^50 — Nn is the most probable value of the x^-distribution with A^„ degrees 
of freedom. The tolerance is then determined separately for each eigenvector direction to 
ensure that all data sets are described within their 90% C.L. limits. 
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Figure 1: The value of the tolerance parameter, T ~ (A^gj^i^j^j)^/^, determined dynamically 
for each eigenvector direction from the condition that each data set must be described within 
its 90% C.L. limit (outer error bars) or 68% C.L. limit (inner error bars). The labels give 
the name of the data set which sets the 90% C.L. tolerance for each eigenvector direction. 



The MS PDF parameterisation at the input scale Qq = 1 GeV^ is taken as: 

xUy=Aux'^^{l~x)'^^{l + euVx + lux), (1) 

xd,=Adx'^3{l~x)'n4{l + edV^ + ldx), (2) 

xS = 2xu + 2xd + xs + xs = A5'a;'^^(l - x)''^S{l + €3 Vx + -fsx), (3) 

xA = xd~ xu = Aa (1 - a;)''^+^(l + 7a a; + ^ a x^), (4) 

xg = Ag x^a (1 - x)^9 (1 + eg + 7s a;) + Ag, x^a' (1 - x)^9' , (5) 

xs + xs = A+ x^"" (1 - .t)^+ (1 + es + 7s x), (6) 

xs-xs = A- x^- (1 - x)^- (1 - x/xo). (7) 



The parameters A^, Ad, Ag and xq are fixed by enforcing number- and momentum-sum 
rule constraints, while the other parameters are allowed to go free. The 20 highlighted 
parameters are those allowed to go free when producing the eigenvector PDF sets, where 
the other parameters are fixed. This is to be compared with the 15 parameters for the 
MRST eigenvector PDF sets. The 5 new parameters include an additional 4 parameters 
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associated with the new strangeness degrees of freedom, ([6]) and ([7]), warranted by the 
inclusion of NuTeV/CCFR dimuon cross sections, and an additional free parameter in the 
input gluon distribution ([5]) which allows more flexibility. The tolerance for each of the 20 
eigenvector directions determined dynamically using the above procedure is shown in Fig. [TJ 
The tolerance is generally close to the MRST value of VbO; however, it is significantly smaller 
for some eigenvectors, such as those associated with the strange quark parameters, which 
are mainly determined only from the NuTeV/CCFR dimuon cross sections. 

The treatment of jet data has been improved with respect to the MRST 2001-2006 
analyses, where six pseudogluon points at = 2000 GeV^ inferred from Tevatron Run 
I inclusive jet data were fitted, rather than the original data points. The MSTW 2008 
analysis uses the f astNLO package to fit the Tevatron Run II and HERA DIS inclusive jet 
data points, including a complete treatment of the correlated systematic errors. At NNLO, 
two-loop threshold corrections are included for Tevatron jet data, and the HERA DIS jet 
data arc excluded. These developments were reported at DIS 2007 [5], where a fit including 
CDF Run II jet data was presented. Recently published D0 Run II jet data have now 
been included in the fit. The Run II jet data prefer a smaller gluon distribution at high 
X compared to the previous Run I data, as seen in Fig. HJa) which shows the ratio of the 
MRST 2006 NNLO gluon distribution to the new NNLO gluon at = 10" GeV^. 

New electroweak data from the Tevatron Run II are now included in the fit. The lepton 
charge asymmetry has been measured in W ^ ev decays in two bins by CDF, and in 
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Total W and Z cross sections 


Bi„ ■ aw (nb) 


Bi^i- ■ az (nb) 


MSTW 2008 NLO (prel.) 
MSTW 2008 NNLO (prel.) 


20.45 (2.650) 
21.44 (2.739) 


1.965 (0.2425) 
2.043 (0.2512) 



Ratio to MSTW 2008 (prel.) 


aw 


az 


MRST 2004 NLO 
MRST 2004 NNLO [3] 


0.974 (0.990) 
0.936 (0.991) 


0.982 (1.000) 
0.940 (1.003) 


MRST 2006 NLO (unpublished) 
MRST 2006 NNLO [5] 


1.002 (0.995) 
0.995 (1.004) 


1.009 (1.001) 
1.001 (1.010) 


CTEQ6.6 NLO 


1.019 (0.978) 


1.022 (0.987) 



Table 1: Predictions for the total W and Z cross sections at the LHC (Tevatron) multiplied 
by the appropriate branching ratio and calculated in the narrow-width approximation using 
the PDG 2006 electroweak parameters. 

W ^ [Lv decays in one bin by D0. D0 will soon have a measurement of the lepton charge 
asymmetry from W ev decays in two p|n bins, corrected for detector effects, which will be 
included in the final MSTW 2008 fit. CDF and D0 data on the Z/7* rapidity distributions 
are also now included in the fit. The new electroweak data mainly constrain the down quark 
distribution since the up quark distribution is already well constrained by structure function 
data where it appears charge-weighted. The ratios of the previous MRST 2006 NNLO up 
and down quark distributions to those from the new fit are shown in Figs. [2ljb,c). 

The inclusion of NuTcV/CCFR dimuon cross sections allows a more flexible param- 
eterisation of the strange and antistrange distributions, given by ([S]) and ([7]), instead of 
the previous MRST assumption that 5(2:, Qq) = s(a;, Qq) — f [i*(a;, Qq) + '^(^1 Qo)] ' with 
K K, 0.5. The dimuon data prefer a suppression of the strange sea at large x compared to the 
non-strange sea, as seen in Fig.[2][d). The new s and s distributions have a larger uncertainty 
than previously since they are no longer tied to u and d at the input scale. The best fit 
gives a non-zero strange sea asymmetry, which is, however, consistent with zero within the 
90% C.L. limit uncertainty band. 

Predictions for the total W and Z cross sections at the Tevatron and LHC are presented 
in Table [U together with the ratio of predictions using some other recent PDF sets to those 
from MSTW 2008. Note that the change in the total W and Z cross sections going from 
MRST 2004 to MRST 2006 was due to an improvement in the heavy flavour prescription [3] , 
while the predictions are relatively stable in going from MRST 2006 to MSTW 2008. 

We will soon have publically available LO, NLO and NNLO parton distributions, each 
with 40 additional eigenvector PDF sets, in time for the LHC start-up. 
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